Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 506 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 108.3 KiB |
| Average record size in memory | 219.2 B |
Variable types
| Categorical | 2 |
|---|---|
| Numeric | 18 |
TOWN has a high cardinality: 92 distinct values | High cardinality |
MEDV is highly correlated with CMEDV | High correlation |
CMEDV is highly correlated with MEDV | High correlation |
RAD is highly correlated with TAX | High correlation |
TAX is highly correlated with RAD | High correlation |
TRACT has unique values | Unique |
ZN has 372 (73.5%) zeros | Zeros |
Reproduction
| Analysis started | 2021-02-20 09:10:48.510233 |
|---|---|
| Analysis finished | 2021-02-20 09:12:06.018326 |
| Duration | 1 minute and 17.51 seconds |
| Software version | pandas-profiling v2.10.1 |
| Download configuration | config.yaml |
| Distinct | 92 |
|---|---|
| Distinct (%) | 18.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.2 KiB |
| Cambridge | 30 |
|---|---|
| Boston Savin Hill | 23 |
| Lynn | 22 |
| Boston Roxbury | 19 |
| Newton | 18 |
| Other values (87) |
Length
| Max length | 23 |
|---|---|
| Median length | 9 |
| Mean length | 9.9743083 |
| Min length | 4 |
Characters and Unicode
| Total characters | 5047 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 17 ? |
|---|---|
| Unique (%) | 3.4% |
Sample
| 1st row | Nahant |
|---|---|
| 2nd row | Swampscott |
| 3rd row | Swampscott |
| 4th row | Marblehead |
| 5th row | Marblehead |
| Value | Count | Frequency (%) |
| Cambridge | 30 | 5.9% |
| Boston Savin Hill | 23 | 4.5% |
| Lynn | 22 | 4.3% |
| Boston Roxbury | 19 | 3.8% |
| Newton | 18 | 3.6% |
| Somerville | 15 | 3.0% |
| Boston South Boston | 13 | 2.6% |
| Boston East Boston | 12 | 2.4% |
| Brookline | 12 | 2.4% |
| Quincy | 12 | 2.4% |
| Other values (82) | 330 |
| Value | Count | Frequency (%) |
| boston | 157 | |
| cambridge | 30 | 4.2% |
| hill | 26 | 3.6% |
| savin | 23 | 3.2% |
| roxbury | 23 | 3.2% |
| lynn | 22 | 3.1% |
| newton | 18 | 2.5% |
| somerville | 15 | 2.1% |
| south | 13 | 1.8% |
| east | 12 | 1.7% |
| Other values (87) | 375 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 618 | 12.2% |
| n | 465 | 9.2% |
| t | 389 | 7.7% |
| e | 378 | 7.5% |
| a | 270 | 5.3% |
| r | 264 | 5.2% |
| s | 254 | 5.0% |
| l | 250 | 5.0% |
| B | 220 | 4.4% |
| i | 219 | 4.3% |
| Other values (31) | 1720 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4109 | |
| Uppercase Letter | 722 | 14.3% |
| Space Separator | 208 | 4.1% |
| Dash Punctuation | 8 | 0.2% |
Most frequent character per category
| Value | Count | Frequency (%) |
| o | 618 | |
| n | 465 | |
| t | 389 | |
| e | 378 | |
| a | 270 | 6.6% |
| r | 264 | 6.4% |
| s | 254 | 6.2% |
| l | 250 | 6.1% |
| i | 219 | 5.3% |
| d | 134 | 3.3% |
| Other values (13) | 868 |
| Value | Count | Frequency (%) |
| B | 220 | |
| S | 75 | 10.4% |
| W | 65 | 9.0% |
| C | 48 | 6.6% |
| H | 44 | 6.1% |
| M | 43 | 6.0% |
| R | 42 | 5.8% |
| N | 41 | 5.7% |
| L | 31 | 4.3% |
| D | 30 | 4.2% |
| Other values (6) | 83 | 11.5% |
| Value | Count | Frequency (%) |
| 208 |
| Value | Count | Frequency (%) |
| - | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4831 | |
| Common | 216 | 4.3% |
Most frequent character per script
| Value | Count | Frequency (%) |
| o | 618 | |
| n | 465 | 9.6% |
| t | 389 | 8.1% |
| e | 378 | 7.8% |
| a | 270 | 5.6% |
| r | 264 | 5.5% |
| s | 254 | 5.3% |
| l | 250 | 5.2% |
| B | 220 | 4.6% |
| i | 219 | 4.5% |
| Other values (29) | 1504 |
| Value | Count | Frequency (%) |
| 208 | ||
| - | 8 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5047 |
Most frequent character per block
| Value | Count | Frequency (%) |
| o | 618 | 12.2% |
| n | 465 | 9.2% |
| t | 389 | 7.7% |
| e | 378 | 7.5% |
| a | 270 | 5.3% |
| r | 264 | 5.2% |
| s | 254 | 5.0% |
| l | 250 | 5.0% |
| B | 220 | 4.4% |
| i | 219 | 4.3% |
| Other values (31) | 1720 |
TOWNNO
Real number (ℝ≥0)
| Distinct | 92 |
|---|---|
| Distinct (%) | 18.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.53162055 |
|---|---|
| Minimum | 0 |
| Maximum | 91 |
| Zeros | 1 |
| Zeros (%) | 0.2% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 26.25 |
| median | 42 |
| Q3 | 78 |
| 95-th percentile | 86.75 |
| Maximum | 91 |
| Range | 91 |
| Interquartile range (IQR) | 51.75 |
Descriptive statistics
| Standard deviation | 27.57140124 |
|---|---|
| Coefficient of variation (CV) | 0.580064406 |
| Kurtosis | -1.318219483 |
| Mean | 47.53162055 |
| Median Absolute Deviation (MAD) | 21.5 |
| Skewness | 0.03920476703 |
| Sum | 24051 |
| Variance | 760.1821665 |
| Monotocity | Increasing |
| Value | Count | Frequency (%) |
| 28 | 30 | 5.9% |
| 83 | 23 | 4.5% |
| 4 | 22 | 4.3% |
| 82 | 19 | 3.8% |
| 40 | 18 | 3.6% |
| 27 | 15 | 3.0% |
| 80 | 13 | 2.6% |
| 59 | 12 | 2.4% |
| 45 | 12 | 2.4% |
| 79 | 12 | 2.4% |
| Other values (82) | 330 |
| Value | Count | Frequency (%) |
| 0 | 1 | 0.2% |
| 1 | 2 | 0.4% |
| 2 | 3 | 0.6% |
| 3 | 7 | 1.4% |
| 4 | 22 | |
| 5 | 4 | 0.8% |
| 6 | 2 | 0.4% |
| 7 | 9 | |
| 8 | 4 | 0.8% |
| 9 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 91 | 5 | 1.0% |
| 90 | 8 | 1.6% |
| 89 | 5 | 1.0% |
| 88 | 4 | 0.8% |
| 87 | 4 | 0.8% |
| 86 | 7 | 1.4% |
| 85 | 6 | 1.2% |
| 84 | 11 | |
| 83 | 23 | |
| 82 | 19 |
| Distinct | 506 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2700.357708 |
|---|---|
| Minimum | 1 |
| Maximum | 5082 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 430.5 |
| Q1 | 1303.25 |
| median | 3393.5 |
| Q3 | 3739.75 |
| 95-th percentile | 4202.75 |
| Maximum | 5082 |
| Range | 5081 |
| Interquartile range (IQR) | 2436.5 |
Descriptive statistics
| Standard deviation | 1380.03811 |
|---|---|
| Coefficient of variation (CV) | 0.5110575188 |
| Kurtosis | -1.196098004 |
| Mean | 2700.357708 |
| Median Absolute Deviation (MAD) | 787 |
| Skewness | -0.4358094348 |
| Sum | 1366381 |
| Variance | 1904505.185 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.2% |
| 3733 | 1 | 0.2% |
| 3746 | 1 | 0.2% |
| 3745 | 1 | 0.2% |
| 3744 | 1 | 0.2% |
| 3743 | 1 | 0.2% |
| 3742 | 1 | 0.2% |
| 3741 | 1 | 0.2% |
| 3740 | 1 | 0.2% |
| 3739 | 1 | 0.2% |
| Other values (496) | 496 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 101 | 1 | |
| 102 | 1 |
| Value | Count | Frequency (%) |
| 5082 | 1 | |
| 5081 | 1 | |
| 5071 | 1 | |
| 5062 | 1 | |
| 5061 | 1 | |
| 5052 | 1 | |
| 5051 | 1 | |
| 5041 | 1 | |
| 5031 | 1 | |
| 5022 | 1 |
LON
Real number (ℝ)
| Distinct | 375 |
|---|---|
| Distinct (%) | 74.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -71.05638874 |
|---|---|
| Minimum | -71.2895 |
| Maximum | -70.81 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | -71.2895 |
|---|---|
| 5-th percentile | -71.202375 |
| Q1 | -71.093225 |
| median | -71.0529 |
| Q3 | -71.019625 |
| 95-th percentile | -70.936 |
| Maximum | -70.81 |
| Range | 0.4795 |
| Interquartile range (IQR) | 0.0736 |
Descriptive statistics
| Standard deviation | 0.07540534773 |
|---|---|
| Coefficient of variation (CV) | -0.001061204335 |
| Kurtosis | 1.108480767 |
| Mean | -71.05638874 |
| Median Absolute Deviation (MAD) | 0.0371 |
| Skewness | -0.2053847315 |
| Sum | -35954.5327 |
| Variance | 0.005685966467 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| -71.069 | 5 | 1.0% |
| -71.04 | 4 | 0.8% |
| -71.0455 | 4 | 0.8% |
| -71.02 | 4 | 0.8% |
| -71.075 | 4 | 0.8% |
| -71.055 | 4 | 0.8% |
| -71.059 | 4 | 0.8% |
| -71.03 | 4 | 0.8% |
| -71.034 | 3 | 0.6% |
| -71.046 | 3 | 0.6% |
| Other values (365) | 467 |
| Value | Count | Frequency (%) |
| -71.2895 | 1 | |
| -71.2807 | 1 | |
| -71.269 | 1 | |
| -71.2685 | 1 | |
| -71.263 | 1 | |
| -71.262 | 1 | |
| -71.2575 | 1 | |
| -71.255 | 1 | |
| -71.2475 | 1 | |
| -71.247 | 1 |
| Value | Count | Frequency (%) |
| -70.81 | 1 | |
| -70.83 | 2 | |
| -70.833 | 1 | |
| -70.8525 | 1 | |
| -70.853 | 1 | |
| -70.855 | 1 | |
| -70.86 | 1 | |
| -70.8875 | 1 | |
| -70.9075 | 1 | |
| -70.915 | 1 |
LAT
Real number (ℝ≥0)
| Distinct | 376 |
|---|---|
| Distinct (%) | 74.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42.21644032 |
|---|---|
| Minimum | 42.03 |
| Maximum | 42.381 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 42.03 |
|---|---|
| 5-th percentile | 42.10745 |
| Q1 | 42.180775 |
| median | 42.2181 |
| Q3 | 42.25225 |
| 95-th percentile | 42.31985 |
| Maximum | 42.381 |
| Range | 0.351 |
| Interquartile range (IQR) | 0.071475 |
Descriptive statistics
| Standard deviation | 0.06177718406 |
|---|---|
| Coefficient of variation (CV) | 0.001463344223 |
| Kurtosis | 0.1040024903 |
| Mean | 42.21644032 |
| Median Absolute Deviation (MAD) | 0.03625 |
| Skewness | -0.08667859819 |
| Sum | 21361.5188 |
| Variance | 0.00381642047 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 42.23 | 5 | 1.0% |
| 42.192 | 4 | 0.8% |
| 42.245 | 4 | 0.8% |
| 42.2075 | 4 | 0.8% |
| 42.188 | 4 | 0.8% |
| 42.225 | 3 | 0.6% |
| 42.223 | 3 | 0.6% |
| 42.2275 | 3 | 0.6% |
| 42.235 | 3 | 0.6% |
| 42.224 | 3 | 0.6% |
| Other values (366) | 470 |
| Value | Count | Frequency (%) |
| 42.03 | 1 | |
| 42.0485 | 1 | |
| 42.052 | 1 | |
| 42.059 | 2 | |
| 42.0675 | 1 | |
| 42.0725 | 2 | |
| 42.0735 | 1 | |
| 42.0775 | 2 | |
| 42.0795 | 1 | |
| 42.0825 | 1 |
| Value | Count | Frequency (%) |
| 42.381 | 1 | |
| 42.374 | 1 | |
| 42.3715 | 2 | |
| 42.3525 | 1 | |
| 42.346 | 2 | |
| 42.345 | 2 | |
| 42.3425 | 1 | |
| 42.34 | 1 | |
| 42.339 | 1 | |
| 42.3382 | 1 |
| Distinct | 229 |
|---|---|
| Distinct (%) | 45.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.53280632 |
|---|---|
| Minimum | 5 |
| Maximum | 50 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 10.2 |
| Q1 | 17.025 |
| median | 21.2 |
| Q3 | 25 |
| 95-th percentile | 43.4 |
| Maximum | 50 |
| Range | 45 |
| Interquartile range (IQR) | 7.975 |
Descriptive statistics
| Standard deviation | 9.197104087 |
|---|---|
| Coefficient of variation (CV) | 0.408165053 |
| Kurtosis | 1.495196944 |
| Mean | 22.53280632 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.108098408 |
| Sum | 11401.6 |
| Variance | 84.58672359 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 16 | 3.2% |
| 25 | 8 | 1.6% |
| 21.7 | 7 | 1.4% |
| 22 | 7 | 1.4% |
| 23.1 | 7 | 1.4% |
| 20.6 | 6 | 1.2% |
| 19.4 | 6 | 1.2% |
| 13.8 | 5 | 1.0% |
| 22.6 | 5 | 1.0% |
| 21.2 | 5 | 1.0% |
| Other values (219) | 434 |
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 5.6 | 1 | 0.2% |
| 6.3 | 1 | 0.2% |
| 7 | 2 | |
| 7.2 | 3 | |
| 7.4 | 1 | 0.2% |
| 7.5 | 1 | 0.2% |
| 8.1 | 1 | 0.2% |
| 8.3 | 2 | |
| 8.4 | 2 |
| Value | Count | Frequency (%) |
| 50 | 16 | |
| 48.8 | 1 | 0.2% |
| 48.5 | 1 | 0.2% |
| 48.3 | 1 | 0.2% |
| 46.7 | 1 | 0.2% |
| 46 | 1 | 0.2% |
| 45.4 | 1 | 0.2% |
| 44.8 | 1 | 0.2% |
| 44 | 1 | 0.2% |
| 43.8 | 1 | 0.2% |
| Distinct | 228 |
|---|---|
| Distinct (%) | 45.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.52885375 |
|---|---|
| Minimum | 5 |
| Maximum | 50 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 10.2 |
| Q1 | 17.025 |
| median | 21.2 |
| Q3 | 25 |
| 95-th percentile | 43.4 |
| Maximum | 50 |
| Range | 45 |
| Interquartile range (IQR) | 7.975 |
Descriptive statistics
| Standard deviation | 9.182175882 |
|---|---|
| Coefficient of variation (CV) | 0.4075740374 |
| Kurtosis | 1.516783448 |
| Mean | 22.52885375 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.11091185 |
| Sum | 11399.6 |
| Variance | 84.31235393 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 16 | 3.2% |
| 25 | 8 | 1.6% |
| 21.7 | 7 | 1.4% |
| 23.1 | 7 | 1.4% |
| 19.4 | 6 | 1.2% |
| 20.6 | 6 | 1.2% |
| 22 | 6 | 1.2% |
| 17.8 | 5 | 1.0% |
| 21.2 | 5 | 1.0% |
| 19.3 | 5 | 1.0% |
| Other values (218) | 435 |
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 5.6 | 1 | 0.2% |
| 6.3 | 1 | 0.2% |
| 7 | 2 | |
| 7.2 | 3 | |
| 7.4 | 1 | 0.2% |
| 7.5 | 1 | 0.2% |
| 8.1 | 1 | 0.2% |
| 8.2 | 1 | 0.2% |
| 8.3 | 2 |
| Value | Count | Frequency (%) |
| 50 | 16 | |
| 48.8 | 1 | 0.2% |
| 48.5 | 1 | 0.2% |
| 48.3 | 1 | 0.2% |
| 46.7 | 1 | 0.2% |
| 46 | 1 | 0.2% |
| 45.4 | 1 | 0.2% |
| 44.8 | 1 | 0.2% |
| 44 | 1 | 0.2% |
| 43.8 | 1 | 0.2% |
CRIM
Real number (ℝ≥0)
| Distinct | 504 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.613523557 |
|---|---|
| Minimum | 0.00632 |
| Maximum | 88.9762 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0.00632 |
|---|---|
| 5-th percentile | 0.02791 |
| Q1 | 0.082045 |
| median | 0.25651 |
| Q3 | 3.6770825 |
| 95-th percentile | 15.78915 |
| Maximum | 88.9762 |
| Range | 88.96988 |
| Interquartile range (IQR) | 3.5950375 |
Descriptive statistics
| Standard deviation | 8.601545105 |
|---|---|
| Coefficient of variation (CV) | 2.380376098 |
| Kurtosis | 37.13050913 |
| Mean | 3.613523557 |
| Median Absolute Deviation (MAD) | 0.22145 |
| Skewness | 5.223148798 |
| Sum | 1828.44292 |
| Variance | 73.9865782 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.01501 | 2 | 0.4% |
| 14.3337 | 2 | 0.4% |
| 0.57834 | 1 | 0.2% |
| 0.06127 | 1 | 0.2% |
| 0.03548 | 1 | 0.2% |
| 0.1403 | 1 | 0.2% |
| 0.03705 | 1 | 0.2% |
| 0.95577 | 1 | 0.2% |
| 0.11747 | 1 | 0.2% |
| 0.03537 | 1 | 0.2% |
| Other values (494) | 494 |
| Value | Count | Frequency (%) |
| 0.00632 | 1 | |
| 0.00906 | 1 | |
| 0.01096 | 1 | |
| 0.01301 | 1 | |
| 0.01311 | 1 | |
| 0.0136 | 1 | |
| 0.01381 | 1 | |
| 0.01432 | 1 | |
| 0.01439 | 1 | |
| 0.01501 | 2 |
| Value | Count | Frequency (%) |
| 88.9762 | 1 | |
| 73.5341 | 1 | |
| 67.9208 | 1 | |
| 51.1358 | 1 | |
| 45.7461 | 1 | |
| 41.5292 | 1 | |
| 38.3518 | 1 | |
| 37.6619 | 1 | |
| 28.6558 | 1 | |
| 25.9406 | 1 |
| Distinct | 26 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.36363636 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros | 372 |
| Zeros (%) | 73.5% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 12.5 |
| 95-th percentile | 80 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 12.5 |
Descriptive statistics
| Standard deviation | 23.32245299 |
|---|---|
| Coefficient of variation (CV) | 2.052375864 |
| Kurtosis | 4.031510084 |
| Mean | 11.36363636 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.225666323 |
| Sum | 5750 |
| Variance | 543.9368137 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 372 | |
| 20 | 21 | 4.2% |
| 80 | 15 | 3.0% |
| 12.5 | 10 | 2.0% |
| 25 | 10 | 2.0% |
| 22 | 10 | 2.0% |
| 40 | 7 | 1.4% |
| 30 | 6 | 1.2% |
| 45 | 6 | 1.2% |
| 90 | 5 | 1.0% |
| Other values (16) | 44 | 8.7% |
| Value | Count | Frequency (%) |
| 0 | 372 | |
| 12.5 | 10 | 2.0% |
| 17.5 | 1 | 0.2% |
| 18 | 1 | 0.2% |
| 20 | 21 | 4.2% |
| 21 | 4 | 0.8% |
| 22 | 10 | 2.0% |
| 25 | 10 | 2.0% |
| 28 | 3 | 0.6% |
| 30 | 6 | 1.2% |
| Value | Count | Frequency (%) |
| 100 | 1 | 0.2% |
| 95 | 4 | 0.8% |
| 90 | 5 | 1.0% |
| 85 | 2 | 0.4% |
| 82.5 | 2 | 0.4% |
| 80 | 15 | |
| 75 | 3 | 0.6% |
| 70 | 3 | 0.6% |
| 60 | 4 | 0.8% |
| 55 | 3 | 0.6% |
INDUS
Real number (ℝ≥0)
| Distinct | 76 |
|---|---|
| Distinct (%) | 15.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.13677866 |
|---|---|
| Minimum | 0.46 |
| Maximum | 27.74 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0.46 |
|---|---|
| 5-th percentile | 2.18 |
| Q1 | 5.19 |
| median | 9.69 |
| Q3 | 18.1 |
| 95-th percentile | 21.89 |
| Maximum | 27.74 |
| Range | 27.28 |
| Interquartile range (IQR) | 12.91 |
Descriptive statistics
| Standard deviation | 6.860352941 |
|---|---|
| Coefficient of variation (CV) | 0.6160087358 |
| Kurtosis | -1.233539601 |
| Mean | 11.13677866 |
| Median Absolute Deviation (MAD) | 6.32 |
| Skewness | 0.2950215679 |
| Sum | 5635.21 |
| Variance | 47.06444247 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 18.1 | 132 | |
| 19.58 | 30 | 5.9% |
| 8.14 | 22 | 4.3% |
| 6.2 | 18 | 3.6% |
| 21.89 | 15 | 3.0% |
| 9.9 | 12 | 2.4% |
| 3.97 | 12 | 2.4% |
| 10.59 | 11 | 2.2% |
| 8.56 | 11 | 2.2% |
| 5.86 | 10 | 2.0% |
| Other values (66) | 233 |
| Value | Count | Frequency (%) |
| 0.46 | 1 | 0.2% |
| 0.74 | 1 | 0.2% |
| 1.21 | 1 | 0.2% |
| 1.22 | 1 | 0.2% |
| 1.25 | 2 | |
| 1.32 | 1 | 0.2% |
| 1.38 | 1 | 0.2% |
| 1.47 | 2 | |
| 1.52 | 4 | |
| 1.69 | 2 |
| Value | Count | Frequency (%) |
| 27.74 | 5 | 1.0% |
| 25.65 | 7 | 1.4% |
| 21.89 | 15 | 3.0% |
| 19.58 | 30 | 5.9% |
| 18.1 | 132 | |
| 15.04 | 3 | 0.6% |
| 13.92 | 5 | 1.0% |
| 13.89 | 4 | 0.8% |
| 12.83 | 6 | 1.2% |
| 11.93 | 5 | 1.0% |
CHAS
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.8 KiB |
| 0 | |
|---|---|
| 1 | 35 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 506 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 471 | |
| 1 | 35 | 6.9% |
| Value | Count | Frequency (%) |
| 0 | 471 | |
| 1 | 35 | 6.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 471 | |
| 1 | 35 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 506 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 471 | |
| 1 | 35 | 6.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 506 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 471 | |
| 1 | 35 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 506 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 471 | |
| 1 | 35 | 6.9% |
NOX
Real number (ℝ≥0)
| Distinct | 81 |
|---|---|
| Distinct (%) | 16.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5546950593 |
|---|---|
| Minimum | 0.385 |
| Maximum | 0.871 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0.385 |
|---|---|
| 5-th percentile | 0.40925 |
| Q1 | 0.449 |
| median | 0.538 |
| Q3 | 0.624 |
| 95-th percentile | 0.74 |
| Maximum | 0.871 |
| Range | 0.486 |
| Interquartile range (IQR) | 0.175 |
Descriptive statistics
| Standard deviation | 0.1158776757 |
|---|---|
| Coefficient of variation (CV) | 0.2089033853 |
| Kurtosis | -0.06466713337 |
| Mean | 0.5546950593 |
| Median Absolute Deviation (MAD) | 0.0875 |
| Skewness | 0.7293079225 |
| Sum | 280.6757 |
| Variance | 0.01342763572 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.538 | 23 | 4.5% |
| 0.713 | 18 | 3.6% |
| 0.437 | 17 | 3.4% |
| 0.871 | 16 | 3.2% |
| 0.624 | 15 | 3.0% |
| 0.489 | 15 | 3.0% |
| 0.605 | 14 | 2.8% |
| 0.693 | 14 | 2.8% |
| 0.74 | 13 | 2.6% |
| 0.544 | 12 | 2.4% |
| Other values (71) | 349 |
| Value | Count | Frequency (%) |
| 0.385 | 1 | 0.2% |
| 0.389 | 1 | 0.2% |
| 0.392 | 2 | |
| 0.394 | 1 | 0.2% |
| 0.398 | 2 | |
| 0.4 | 4 | |
| 0.401 | 3 | |
| 0.403 | 3 | |
| 0.404 | 3 | |
| 0.405 | 3 |
| Value | Count | Frequency (%) |
| 0.871 | 16 | |
| 0.77 | 8 | |
| 0.74 | 13 | |
| 0.718 | 6 | 1.2% |
| 0.713 | 18 | |
| 0.7 | 11 | |
| 0.693 | 14 | |
| 0.679 | 8 | |
| 0.671 | 7 | 1.4% |
| 0.668 | 3 | 0.6% |
RM
Real number (ℝ≥0)
| Distinct | 446 |
|---|---|
| Distinct (%) | 88.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.284634387 |
|---|---|
| Minimum | 3.561 |
| Maximum | 8.78 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 3.561 |
|---|---|
| 5-th percentile | 5.314 |
| Q1 | 5.8855 |
| median | 6.2085 |
| Q3 | 6.6235 |
| 95-th percentile | 7.5875 |
| Maximum | 8.78 |
| Range | 5.219 |
| Interquartile range (IQR) | 0.738 |
Descriptive statistics
| Standard deviation | 0.7026171434 |
|---|---|
| Coefficient of variation (CV) | 0.1117992074 |
| Kurtosis | 1.891500366 |
| Mean | 6.284634387 |
| Median Absolute Deviation (MAD) | 0.3455 |
| Skewness | 0.4036121333 |
| Sum | 3180.025 |
| Variance | 0.4936708502 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.167 | 3 | 0.6% |
| 6.405 | 3 | 0.6% |
| 5.713 | 3 | 0.6% |
| 6.417 | 3 | 0.6% |
| 6.127 | 3 | 0.6% |
| 6.229 | 3 | 0.6% |
| 5.39 | 2 | 0.4% |
| 5.304 | 2 | 0.4% |
| 6.968 | 2 | 0.4% |
| 6.009 | 2 | 0.4% |
| Other values (436) | 480 |
| Value | Count | Frequency (%) |
| 3.561 | 1 | |
| 3.863 | 1 | |
| 4.138 | 2 | |
| 4.368 | 1 | |
| 4.519 | 1 | |
| 4.628 | 1 | |
| 4.652 | 1 | |
| 4.88 | 1 | |
| 4.903 | 1 | |
| 4.906 | 1 |
| Value | Count | Frequency (%) |
| 8.78 | 1 | |
| 8.725 | 1 | |
| 8.704 | 1 | |
| 8.398 | 1 | |
| 8.375 | 1 | |
| 8.337 | 1 | |
| 8.297 | 1 | |
| 8.266 | 1 | |
| 8.259 | 1 | |
| 8.247 | 1 |
AGE
Real number (ℝ≥0)
| Distinct | 356 |
|---|---|
| Distinct (%) | 70.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68.57490119 |
|---|---|
| Minimum | 2.9 |
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 2.9 |
|---|---|
| 5-th percentile | 17.725 |
| Q1 | 45.025 |
| median | 77.5 |
| Q3 | 94.075 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 97.1 |
| Interquartile range (IQR) | 49.05 |
Descriptive statistics
| Standard deviation | 28.14886141 |
|---|---|
| Coefficient of variation (CV) | 0.410483441 |
| Kurtosis | -0.9677155942 |
| Mean | 68.57490119 |
| Median Absolute Deviation (MAD) | 19.55 |
| Skewness | -0.5989626399 |
| Sum | 34698.9 |
| Variance | 792.3583985 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 43 | 8.5% |
| 97.9 | 4 | 0.8% |
| 96 | 4 | 0.8% |
| 95.4 | 4 | 0.8% |
| 98.2 | 4 | 0.8% |
| 87.9 | 4 | 0.8% |
| 98.8 | 4 | 0.8% |
| 97.4 | 3 | 0.6% |
| 94.1 | 3 | 0.6% |
| 96.2 | 3 | 0.6% |
| Other values (346) | 430 |
| Value | Count | Frequency (%) |
| 2.9 | 1 | |
| 6 | 1 | |
| 6.2 | 1 | |
| 6.5 | 1 | |
| 6.6 | 2 | |
| 6.8 | 1 | |
| 7.8 | 2 | |
| 8.4 | 1 | |
| 8.9 | 1 | |
| 9.8 | 1 |
| Value | Count | Frequency (%) |
| 100 | 43 | |
| 99.3 | 1 | 0.2% |
| 99.1 | 1 | 0.2% |
| 98.9 | 3 | 0.6% |
| 98.8 | 4 | 0.8% |
| 98.7 | 1 | 0.2% |
| 98.5 | 1 | 0.2% |
| 98.4 | 2 | 0.4% |
| 98.3 | 2 | 0.4% |
| 98.2 | 4 | 0.8% |
DIS
Real number (ℝ≥0)
| Distinct | 412 |
|---|---|
| Distinct (%) | 81.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.795042688 |
|---|---|
| Minimum | 1.1296 |
| Maximum | 12.1265 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 1.1296 |
|---|---|
| 5-th percentile | 1.461975 |
| Q1 | 2.100175 |
| median | 3.20745 |
| Q3 | 5.188425 |
| 95-th percentile | 7.8278 |
| Maximum | 12.1265 |
| Range | 10.9969 |
| Interquartile range (IQR) | 3.08825 |
Descriptive statistics
| Standard deviation | 2.105710127 |
|---|---|
| Coefficient of variation (CV) | 0.5548580872 |
| Kurtosis | 0.4879411222 |
| Mean | 3.795042688 |
| Median Absolute Deviation (MAD) | 1.29115 |
| Skewness | 1.011780579 |
| Sum | 1920.2916 |
| Variance | 4.434015137 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.4952 | 5 | 1.0% |
| 5.7209 | 4 | 0.8% |
| 5.2873 | 4 | 0.8% |
| 6.8147 | 4 | 0.8% |
| 5.4007 | 4 | 0.8% |
| 7.8278 | 3 | 0.6% |
| 3.9454 | 3 | 0.6% |
| 7.309 | 3 | 0.6% |
| 5.4917 | 3 | 0.6% |
| 6.4798 | 3 | 0.6% |
| Other values (402) | 470 |
| Value | Count | Frequency (%) |
| 1.1296 | 1 | |
| 1.137 | 1 | |
| 1.1691 | 1 | |
| 1.1742 | 1 | |
| 1.1781 | 1 | |
| 1.2024 | 1 | |
| 1.2852 | 1 | |
| 1.3163 | 1 | |
| 1.3216 | 1 | |
| 1.3325 | 1 |
| Value | Count | Frequency (%) |
| 12.1265 | 1 | |
| 10.7103 | 2 | |
| 10.5857 | 2 | |
| 9.2229 | 1 | |
| 9.2203 | 2 | |
| 9.1876 | 1 | |
| 9.0892 | 1 | |
| 8.9067 | 2 | |
| 8.7921 | 2 | |
| 8.6966 | 1 |
| Distinct | 9 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.549407115 |
|---|---|
| Minimum | 1 |
| Maximum | 24 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 5 |
| Q3 | 24 |
| 95-th percentile | 24 |
| Maximum | 24 |
| Range | 23 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 8.707259384 |
|---|---|
| Coefficient of variation (CV) | 0.9118115166 |
| Kurtosis | -0.8672319936 |
| Mean | 9.549407115 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.004814648 |
| Sum | 4832 |
| Variance | 75.81636598 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 132 | |
| 5 | 115 | |
| 4 | 110 | |
| 3 | 38 | 7.5% |
| 6 | 26 | 5.1% |
| 2 | 24 | 4.7% |
| 8 | 24 | 4.7% |
| 1 | 20 | 4.0% |
| 7 | 17 | 3.4% |
| Value | Count | Frequency (%) |
| 1 | 20 | 4.0% |
| 2 | 24 | 4.7% |
| 3 | 38 | 7.5% |
| 4 | 110 | |
| 5 | 115 | |
| 6 | 26 | 5.1% |
| 7 | 17 | 3.4% |
| 8 | 24 | 4.7% |
| 24 | 132 |
| Value | Count | Frequency (%) |
| 24 | 132 | |
| 8 | 24 | 4.7% |
| 7 | 17 | 3.4% |
| 6 | 26 | 5.1% |
| 5 | 115 | |
| 4 | 110 | |
| 3 | 38 | 7.5% |
| 2 | 24 | 4.7% |
| 1 | 20 | 4.0% |
| Distinct | 66 |
|---|---|
| Distinct (%) | 13.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 408.2371542 |
|---|---|
| Minimum | 187 |
| Maximum | 711 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 187 |
|---|---|
| 5-th percentile | 222 |
| Q1 | 279 |
| median | 330 |
| Q3 | 666 |
| 95-th percentile | 666 |
| Maximum | 711 |
| Range | 524 |
| Interquartile range (IQR) | 387 |
Descriptive statistics
| Standard deviation | 168.5371161 |
|---|---|
| Coefficient of variation (CV) | 0.4128411987 |
| Kurtosis | -1.142407992 |
| Mean | 408.2371542 |
| Median Absolute Deviation (MAD) | 73 |
| Skewness | 0.6699559418 |
| Sum | 206568 |
| Variance | 28404.75949 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 666 | 132 | |
| 307 | 40 | 7.9% |
| 403 | 30 | 5.9% |
| 437 | 15 | 3.0% |
| 304 | 14 | 2.8% |
| 264 | 12 | 2.4% |
| 398 | 12 | 2.4% |
| 384 | 11 | 2.2% |
| 277 | 11 | 2.2% |
| 330 | 10 | 2.0% |
| Other values (56) | 219 |
| Value | Count | Frequency (%) |
| 187 | 1 | 0.2% |
| 188 | 7 | |
| 193 | 8 | |
| 198 | 1 | 0.2% |
| 216 | 5 | |
| 222 | 7 | |
| 223 | 5 | |
| 224 | 10 | |
| 226 | 1 | 0.2% |
| 233 | 9 |
| Value | Count | Frequency (%) |
| 711 | 5 | 1.0% |
| 666 | 132 | |
| 469 | 1 | 0.2% |
| 437 | 15 | 3.0% |
| 432 | 9 | 1.8% |
| 430 | 3 | 0.6% |
| 422 | 1 | 0.2% |
| 411 | 2 | 0.4% |
| 403 | 30 | 5.9% |
| 402 | 2 | 0.4% |
PTRATIO
Real number (ℝ≥0)
| Distinct | 46 |
|---|---|
| Distinct (%) | 9.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.4555336 |
|---|---|
| Minimum | 12.6 |
| Maximum | 22 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 12.6 |
|---|---|
| 5-th percentile | 14.7 |
| Q1 | 17.4 |
| median | 19.05 |
| Q3 | 20.2 |
| 95-th percentile | 21 |
| Maximum | 22 |
| Range | 9.4 |
| Interquartile range (IQR) | 2.8 |
Descriptive statistics
| Standard deviation | 2.164945524 |
|---|---|
| Coefficient of variation (CV) | 0.1173060379 |
| Kurtosis | -0.2850913833 |
| Mean | 18.4555336 |
| Median Absolute Deviation (MAD) | 1.15 |
| Skewness | -0.8023249269 |
| Sum | 9338.5 |
| Variance | 4.686989121 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 20.2 | 140 | |
| 14.7 | 34 | 6.7% |
| 21 | 27 | 5.3% |
| 17.8 | 23 | 4.5% |
| 19.2 | 19 | 3.8% |
| 17.4 | 18 | 3.6% |
| 18.6 | 17 | 3.4% |
| 19.1 | 17 | 3.4% |
| 16.6 | 16 | 3.2% |
| 18.4 | 16 | 3.2% |
| Other values (36) | 179 |
| Value | Count | Frequency (%) |
| 12.6 | 3 | 0.6% |
| 13 | 12 | 2.4% |
| 13.6 | 1 | 0.2% |
| 14.4 | 1 | 0.2% |
| 14.7 | 34 | |
| 14.8 | 3 | 0.6% |
| 14.9 | 4 | 0.8% |
| 15.1 | 1 | 0.2% |
| 15.2 | 13 | 2.6% |
| 15.3 | 3 | 0.6% |
| Value | Count | Frequency (%) |
| 22 | 2 | 0.4% |
| 21.2 | 15 | 3.0% |
| 21.1 | 1 | 0.2% |
| 21 | 27 | 5.3% |
| 20.9 | 11 | 2.2% |
| 20.2 | 140 | |
| 20.1 | 5 | 1.0% |
| 19.7 | 8 | 1.6% |
| 19.6 | 8 | 1.6% |
| 19.2 | 19 | 3.8% |
B
Real number (ℝ≥0)
| Distinct | 357 |
|---|---|
| Distinct (%) | 70.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 356.6740316 |
|---|---|
| Minimum | 0.32 |
| Maximum | 396.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 0.32 |
|---|---|
| 5-th percentile | 84.59 |
| Q1 | 375.3775 |
| median | 391.44 |
| Q3 | 396.225 |
| 95-th percentile | 396.9 |
| Maximum | 396.9 |
| Range | 396.58 |
| Interquartile range (IQR) | 20.8475 |
Descriptive statistics
| Standard deviation | 91.29486438 |
|---|---|
| Coefficient of variation (CV) | 0.255961624 |
| Kurtosis | 7.226817549 |
| Mean | 356.6740316 |
| Median Absolute Deviation (MAD) | 5.46 |
| Skewness | -2.890373712 |
| Sum | 180477.06 |
| Variance | 8334.752263 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 396.9 | 121 | 23.9% |
| 395.24 | 3 | 0.6% |
| 393.74 | 3 | 0.6% |
| 394.12 | 2 | 0.4% |
| 395.56 | 2 | 0.4% |
| 390.94 | 2 | 0.4% |
| 388.45 | 2 | 0.4% |
| 393.23 | 2 | 0.4% |
| 396.21 | 2 | 0.4% |
| 393.37 | 2 | 0.4% |
| Other values (347) | 365 |
| Value | Count | Frequency (%) |
| 0.32 | 1 | |
| 2.52 | 1 | |
| 2.6 | 1 | |
| 3.5 | 1 | |
| 3.65 | 1 | |
| 6.68 | 1 | |
| 7.68 | 1 | |
| 9.32 | 1 | |
| 10.48 | 1 | |
| 16.45 | 1 |
| Value | Count | Frequency (%) |
| 396.9 | 121 | |
| 396.42 | 1 | 0.2% |
| 396.33 | 1 | 0.2% |
| 396.3 | 1 | 0.2% |
| 396.28 | 1 | 0.2% |
| 396.24 | 1 | 0.2% |
| 396.23 | 1 | 0.2% |
| 396.21 | 2 | 0.4% |
| 396.14 | 1 | 0.2% |
| 396.06 | 2 | 0.4% |
LSTAT
Real number (ℝ≥0)
| Distinct | 455 |
|---|---|
| Distinct (%) | 89.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.65306324 |
|---|---|
| Minimum | 1.73 |
| Maximum | 37.97 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.1 KiB |
Quantile statistics
| Minimum | 1.73 |
|---|---|
| 5-th percentile | 3.7075 |
| Q1 | 6.95 |
| median | 11.36 |
| Q3 | 16.955 |
| 95-th percentile | 26.8075 |
| Maximum | 37.97 |
| Range | 36.24 |
| Interquartile range (IQR) | 10.005 |
Descriptive statistics
| Standard deviation | 7.141061511 |
|---|---|
| Coefficient of variation (CV) | 0.5643741263 |
| Kurtosis | 0.4932395174 |
| Mean | 12.65306324 |
| Median Absolute Deviation (MAD) | 4.795 |
| Skewness | 0.9064600936 |
| Sum | 6402.45 |
| Variance | 50.99475951 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.05 | 3 | 0.6% |
| 6.36 | 3 | 0.6% |
| 18.13 | 3 | 0.6% |
| 14.1 | 3 | 0.6% |
| 7.79 | 3 | 0.6% |
| 18.46 | 2 | 0.4% |
| 9.97 | 2 | 0.4% |
| 5.33 | 2 | 0.4% |
| 10.45 | 2 | 0.4% |
| 6.72 | 2 | 0.4% |
| Other values (445) | 481 |
| Value | Count | Frequency (%) |
| 1.73 | 1 | |
| 1.92 | 1 | |
| 1.98 | 1 | |
| 2.47 | 1 | |
| 2.87 | 1 | |
| 2.88 | 1 | |
| 2.94 | 1 | |
| 2.96 | 1 | |
| 2.97 | 1 | |
| 2.98 | 1 |
| Value | Count | Frequency (%) |
| 37.97 | 1 | |
| 36.98 | 1 | |
| 34.77 | 1 | |
| 34.41 | 1 | |
| 34.37 | 1 | |
| 34.02 | 1 | |
| 31.99 | 1 | |
| 30.81 | 2 | |
| 30.63 | 1 | |
| 30.62 | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| TOWN | TOWNNO | TRACT | LON | LAT | MEDV | CMEDV | CRIM | ZN | INDUS | CHAS | NOX | RM | AGE | DIS | RAD | TAX | PTRATIO | B | LSTAT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Nahant | 0 | 2011 | -70.9550 | 42.2550 | 24.0 | 24.0 | 0.00632 | 18.0 | 2.31 | 0 | 0.538 | 6.575 | 65.2 | 4.0900 | 1 | 296 | 15.3 | 396.90 | 4.98 |
| 1 | Swampscott | 1 | 2021 | -70.9500 | 42.2875 | 21.6 | 21.6 | 0.02731 | 0.0 | 7.07 | 0 | 0.469 | 6.421 | 78.9 | 4.9671 | 2 | 242 | 17.8 | 396.90 | 9.14 |
| 2 | Swampscott | 1 | 2022 | -70.9360 | 42.2830 | 34.7 | 34.7 | 0.02729 | 0.0 | 7.07 | 0 | 0.469 | 7.185 | 61.1 | 4.9671 | 2 | 242 | 17.8 | 392.83 | 4.03 |
| 3 | Marblehead | 2 | 2031 | -70.9280 | 42.2930 | 33.4 | 33.4 | 0.03237 | 0.0 | 2.18 | 0 | 0.458 | 6.998 | 45.8 | 6.0622 | 3 | 222 | 18.7 | 394.63 | 2.94 |
| 4 | Marblehead | 2 | 2032 | -70.9220 | 42.2980 | 36.2 | 36.2 | 0.06905 | 0.0 | 2.18 | 0 | 0.458 | 7.147 | 54.2 | 6.0622 | 3 | 222 | 18.7 | 396.90 | 5.33 |
| 5 | Marblehead | 2 | 2033 | -70.9165 | 42.3040 | 28.7 | 28.7 | 0.02985 | 0.0 | 2.18 | 0 | 0.458 | 6.430 | 58.7 | 6.0622 | 3 | 222 | 18.7 | 394.12 | 5.21 |
| 6 | Salem | 3 | 2041 | -70.9360 | 42.2970 | 22.9 | 22.9 | 0.08829 | 12.5 | 7.87 | 0 | 0.524 | 6.012 | 66.6 | 5.5605 | 5 | 311 | 15.2 | 395.60 | 12.43 |
| 7 | Salem | 3 | 2042 | -70.9375 | 42.3100 | 27.1 | 22.1 | 0.14455 | 12.5 | 7.87 | 0 | 0.524 | 6.172 | 96.1 | 5.9505 | 5 | 311 | 15.2 | 396.90 | 19.15 |
| 8 | Salem | 3 | 2043 | -70.9330 | 42.3120 | 16.5 | 16.5 | 0.21124 | 12.5 | 7.87 | 0 | 0.524 | 5.631 | 100.0 | 6.0821 | 5 | 311 | 15.2 | 386.63 | 29.93 |
| 9 | Salem | 3 | 2044 | -70.9290 | 42.3160 | 18.9 | 18.9 | 0.17004 | 12.5 | 7.87 | 0 | 0.524 | 6.004 | 85.9 | 6.5921 | 5 | 311 | 15.2 | 386.71 | 17.10 |
Last rows
| TOWN | TOWNNO | TRACT | LON | LAT | MEDV | CMEDV | CRIM | ZN | INDUS | CHAS | NOX | RM | AGE | DIS | RAD | TAX | PTRATIO | B | LSTAT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 496 | Revere | 90 | 1704 | -71.0010 | 42.2525 | 19.7 | 19.7 | 0.28960 | 0.0 | 9.69 | 0 | 0.585 | 5.390 | 72.9 | 2.7986 | 6 | 391 | 19.2 | 396.90 | 21.14 |
| 497 | Revere | 90 | 1705 | -70.9947 | 42.2496 | 18.3 | 18.3 | 0.26838 | 0.0 | 9.69 | 0 | 0.585 | 5.794 | 70.6 | 2.8927 | 6 | 391 | 19.2 | 396.90 | 14.10 |
| 498 | Revere | 90 | 1706 | -71.0050 | 42.2455 | 21.2 | 21.2 | 0.23912 | 0.0 | 9.69 | 0 | 0.585 | 6.019 | 65.3 | 2.4091 | 6 | 391 | 19.2 | 396.90 | 12.92 |
| 499 | Revere | 90 | 1707 | -70.9985 | 42.2430 | 17.5 | 17.5 | 0.17783 | 0.0 | 9.69 | 0 | 0.585 | 5.569 | 73.5 | 2.3999 | 6 | 391 | 19.2 | 395.77 | 15.10 |
| 500 | Revere | 90 | 1708 | -70.9920 | 42.2380 | 16.8 | 16.8 | 0.22438 | 0.0 | 9.69 | 0 | 0.585 | 6.027 | 79.7 | 2.4982 | 6 | 391 | 19.2 | 396.90 | 14.33 |
| 501 | Winthrop | 91 | 1801 | -70.9860 | 42.2312 | 22.4 | 22.4 | 0.06263 | 0.0 | 11.93 | 0 | 0.573 | 6.593 | 69.1 | 2.4786 | 1 | 273 | 21.0 | 391.99 | 9.67 |
| 502 | Winthrop | 91 | 1802 | -70.9910 | 42.2275 | 20.6 | 20.6 | 0.04527 | 0.0 | 11.93 | 0 | 0.573 | 6.120 | 76.7 | 2.2875 | 1 | 273 | 21.0 | 396.90 | 9.08 |
| 503 | Winthrop | 91 | 1803 | -70.9948 | 42.2260 | 23.9 | 23.9 | 0.06076 | 0.0 | 11.93 | 0 | 0.573 | 6.976 | 91.0 | 2.1675 | 1 | 273 | 21.0 | 396.90 | 5.64 |
| 504 | Winthrop | 91 | 1804 | -70.9875 | 42.2240 | 22.0 | 22.0 | 0.10959 | 0.0 | 11.93 | 0 | 0.573 | 6.794 | 89.3 | 2.3889 | 1 | 273 | 21.0 | 393.45 | 6.48 |
| 505 | Winthrop | 91 | 1805 | -70.9825 | 42.2210 | 11.9 | 19.0 | 0.04741 | 0.0 | 11.93 | 0 | 0.573 | 6.030 | 80.8 | 2.5050 | 1 | 273 | 21.0 | 396.90 | 7.88 |